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Abstract 

Many fundamental problems in artificial intelligence, knowledge representation, and ver- 
ification involve reasoning about sets and relations between sets and can be modeled as set 
constraint satisfaction problems (set CSPs). Such problems are frequently intractable, but 
there are several important set CSPs that are known to be polynomial-time tractable. We 
introduce a large class of set CSPs that can be solved in quadratic time. Our class, which we 
call ST, contains all previously known tractable set CSPs, but also some new ones that are 
of crucial importance for example in description logics. The class of £T set constraints has 
an elegant universal-algebraic characterization, which we use to show that every set constraint 
language that properly contains all £T set constraints already has a finite sublanguage with an 
NP-hard constraint satisfaction problem. 

An extended abstract of the content of this article appeared in the proceedings of IJCAFll |BHK11] . 

1 Introduction 

Constraint satisfaction problems are computational problems where, informally, the input consists 
of a finite set of variables and a finite set of constraints imposed on those variables; the task is 
to decide whether there is an assignment of values to the variables such that all the constraints 
are simultaneously satisfied. Set constraint satisfaction problems are special constraint satisfaction 
problems where the values are sets, and the constraints might, for instance, force that one set y 
includes another set a;, or that one set x is disjoint to another set y. The constraints might also 
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be ternary, such as the constraint that the intersection of two sets x and y is contained in z, in 
symbols (x n y) c 2. 

To systematically study the computational complexity of constraint satisfaction problems, it 
has turned out to be a fruitful approach to consider constraint satisfaction problems CSP(r) where 
the set of allowed constraints is formed from a fixed set F of relations R c over a common 
domain D. This way of parametrizing the constraint satisfaction problem by a constraint language 
r has led to many strong algorithmic resuhs [BD061 IIMM+ 10[ IBK09al IBK07[ IBKOQb] . and to many 
powerful hardness conditions for large classes of constraint satisfaction problems |Sch78[ IBKJOSi 
IBul03l[&II06l IBKOQb) ■ 

A set constraint language F is a set of relations R c where the common domain 

D = V{H) is the set of all subsets of the natural numbers; moreover, we require that each relation 
R can be defined by a Boolean combination of equations over the signature n, u, c, 0, and 1, which 
are function symbols for intersection, union, complementation, the empty and full set, respectively. 
Details of the formal definition and many examples of set constraint languages can be found in 
Section m The choice of N is just for notational convenience; as we will see, we could have selected 
any infinite set for our purposes. In the following, a set constraint satisfaction problem (set CSP) 
is a problem of the form CSP(F) for a set constraint language F. It has been shown by Marriott 
and Odersky |M096j that all set CSPs are contained in NP; they also showed that the largest set 
constraint language, which consists of all relations that can be defined as described above, has an 
NP-hard set CSP. 

Drakengren and Jonsson |DJ98) initiated the search for set CSPs that can be solved in polynomial 
time. They showed that CSP({c, ||, /}) can be solved in polynomial time, where 

• X c. y holds iff x is a subset of or equal to y; 

• X \ \ y holds iff x and y are disjoint sets; and 

• X ¥^ y holds iff x and y are distinct sets. 

They also showed that CSP(F) can be solved in polynomial time if all relations in F can be defined 
by formulas of the form 

7^ yi V • ■ • V 7^ yfc V xo e 2/0 

or of the form 

a;i # yi V • • ■ V Xfc # yfc V xo 1 1 yo 

where xq, . . . , x^, yo, . . . , y^ are not necessarily distinct variables. We will call the set of all relations 
that can be defined in this way Drakengren and Jonsson's set constraint language. It is easy to see 
that the algorithm they present runs in time quadratic in the size of the input. 

On the other hand, Drakengren and Jonsson [DJ98' show that if F contains the relations defined 
by formulas of the form 

Xi 7^ yi V • • • V Xfc # yfc V Wi 1 1 Ui V • • • V Mfc 1 1 Wfc 

the problem CSP(F) is NP-hard. 

Contributions and Outline. We present a significant extension of Drakengren and Jonsson's 
set constraint language (Section [3]) whose CSP can still be solved in quadratic time in the input 
size (Section [5]); we call this set constraint language £1. Unlike Drakengren and Jonsson's set 
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constraint language, our language also contains the ternary relation defined by {x ny) c z, which is 
a relation that is of particular interest in description logics - we will discuss this below. Moreover, 
we show that any further extension of £1 contains a finite sublanguage with an NP-hard set CSP 
(Section [7]), using concepts from model theory and universal algebra. In this sense, we present a 
maximal tractable class of set constraint satisfaction problems. 

Our algorithm is based on the concept of independence in constraint languages which was 
discovered several times independently in the 90 's iKou01[|JB98llMO96| - see also [B JRMIICJJKOO] : 
however, we apply this concept twice in a novel, nested way, which leads to a two level resolution 
procedure that can be implemented to run in quadratic time. The technique we use to prove the 
correctness of the algorithm is also an important contribution of our paper, and we believe that a 
similar approach can be applied in many other contexts; our technique is inspired by the already 
mentioned connection to universal algebra. 

Application Areas and Related Literature 

Set Constraints for Programming Languages. Set constraints find applications in program 
analysis; here, a set constraint is of the form X <^Y, where X and Y are set expressions. Examples 
of set expressions are (denoting the empty set), set-valued variables, and union and intersection 
of sets, but also expressions of the form f{Zi, Zi) where / is a function symbol and Zi, Zi are again 
set expressions. Unfortunately, the worst-case complexity of most of the reasoning tasks considered 
in this setting is very high, often EXPTIME-hard; see |Aik94| for a survey. More recently, it 
has been shown that the quantifier-free combination of set constraints (without function symbols) 
and cardinality constraints (quantifier-free Pressburger arithmetic) has a satisfiability problem in 
NP [KR07j . This logic (called QFBAPA) is interesting for program verification IKNR06j . 

Tractable Description Logics. Description logics are a family of knowledge representation 
formalisms that can be used to formalize and reason with concept definitions. The computational 
complexity of most of the computational tasks that have been studied for the various formalisms 
is usually quite high. However, in the last years a series of description logics (for example EL, 
EC^^ , Horn-J^Lo; and various extensions and fragments |KM02| IBaa03| IBBLOSj IKRH06| ) has 
been discovered where crucial tasks such as e.g. entailment, concept satisfiability and knowledge 
base satisfiability can be decided in polynomial time. 

Two of the basic assertions that can be made in £C^^ and Horn-J-"io are Ci||C2 {there is no Ci 
that is also C2) and Ci n C2 ^ Cs {every Ci that is C2 is also C3), for concept names Ci,C2,C3. 
These are £1 set constraints, and the latter has not been treated in the framework of Drakengren 
and Jonsson. None of the description logics with a tractable knowledge base satisfiability problem 
contains all £1 set constraints. 

Spatial Reasoning. Several spatial reasoning formalisms (like RCC-5 and RCC-8) are closely 
related to set constraint satisfaction problems. These formalisms allow to reason about relations 
between regions; in the fundamental formalism RCC-5 (see e.g. |JD97| ). one can think of a region as 
a non-empty set, and possible (binary) relationships are containment, disjointness, equality, overlap, 
and disjunctive combinations thereof. Thus, the exclusion of the empty set is the most prominent 
difference between the set constraint languages studied by Drakengren and Jonsson in |DJ98j (which 
are contained in the class of set constraint languages considered here), and RCC-5 and its fragments. 
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2 Constraint Satisfaction Problems 



To use existing terminology in logic and model theory, it will be convenient to formalize constraint 
languages as (relational) structures (see e.g. |Hod93| ). A structure F is a tuple {D; /f ,/]",..., i?^, i?2i ••• ) 
where D is a. set (the domain of T), each f[ is a function from Z?*^' D (where ki is called the 
arity of f[), and each i?f is a relation over D, i.e., a subset of D'* (where li is called the arity of 
i?f ). For each function ff we assume that there is a function symbol which we denote by fi, and 
for each relation we have a relation symbol which we denote by Ri. Constant symbols will be 
treated as 0-ary function symbols. The set r of all relation and function symbols for some structure 
F is called the signature of F, and we also say that F is a r-structure. If the signature of F only 
contains relation symbols and no function symbols, we also say that F is a relational structure. In 
the context of constraint satisfaction, relational structures F are also called constraint languages, 
and a constraint language F' is called a sublanguage (or reduct) of a constraint language F if the 
relations in F' are a subset of the relations in F (and F is called an expansion of F'). 

Let F be a relational structure with domain D and a finite signature r. The constraint sat- 
isfaction problem for F is the following computational problem, also denoted by CSP(F): given a 
finite set of variables V and a conjunction $ of atomic formulas of the form R{xi, . . . ,Xk), where 
xi, . . . ,Xk s V and R e t, does there exists an assignment s: V D such that for every constraint 
R{xi, . . . , Xk) in the input we have that (s(a;i), . . . , s{xk)) e R^"! 

The mapping s is also called a solution to the instance $ of CSP(F), and the conjuncts of $ are 
called constraints. Note that we only introduce constraint satisfaction problems CSP(F) for finite 
constraint languages, i.e., relational structures F with a finite relational signature. 

3 Set Constraint Languages 

In this section, we give formal definitions of set constraint languages. Let (3 be the structure with 
domain 7^(N), the set of all subsets of natural numbers, and with signature {n, u, c, 0, 1}, where 

• n is a binary function symbol that denotes intersection, i.e., n® = n; 

• u is a binary function symbol for union, i.e., u® = u; 

• c is a unary function symbol for complementation, i.e., c® is the function that maps S* c N 
to N\5; 

• and 1 are constants (treated as 0-ary function symbols) denoting the empty set and the 
full set N, respectively. 

Sometimes, we simply write n for the function n® and u for the function u®, i.e., we do not 
distinguish between a function symbol and the respective function. We use the symbols n , u and 
not the symbols n, u to prevent confusion with meta- mathematical usages of n and u in the text. 

A set constraint language is a relational structure with a set of relations with a quantifier-free 
first-order definition in (3. We always allow equality in first-order formulas, and the equality symbol 
= is always interpreted to be the true equality relation on the domain of the structure. 

Example 1. The ternary relation {(x, y, z) 6 7'(N)'^ | a: n y E z} has the quantifier-free first-order 
definition zr\(xny)=xr\y over (3. 
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Theorem 2 (Follows from Proposition 5.8 in [M096j ). Let T he a set constraint language with a 
finite signature. Then CSP(r) is in NP. 

It is well-known that the structure u, n, c, 0, 1) is a Boolean algebra, with 

• playing the role of false, and 1 playing the role of true; 

• c playing the role of — 

• n and u playing the role of a and v , respectively. 

To not confuse logical connectives with the connectives of Boolean algebras, we always use the 
symbols n, u, and c instead of the usual function symbols a, v, and — ■ in Boolean algebras. To 
facilitate the notation, we also write x instead of c(a;), and x ¥^ y instead of — '(x = ?/). 

We assume that all terms t over the functional signature {n,u,c, 0,1} are written in (inner) 
conjunctive normal form ( CNF), i.e., as i = n"=i LJj=i ^ij where Uj is either of the form x or of the 
form X for a variable x (every term over {n, u, c, 0, 1} can be re- written into an equivalent term of 
this form, using the usual laws of Boolean algebras jBoo47| ). We allow the special case n = (in 
which case t becomes 1), and the special case = (in which case Uj becomes 0). We refer 

to Ci := {lij I 1 ^ J ^ Ui] as an (inner) clause of and to lij as an (inner) literal of c^. We say 
that a set of inner clauses is satisfiable if there exists an assignment from V P{N) such that for 
all inner clauses, the union of the evaluation of all literals equals N (this is the case if and only if 
the formula t = 1 has a satisfying assignment). 

We assume that all quantifier-free first-order formulas (/> over the signature {n, u,c, 0, 1} are 
written in (outer) conjunctive normal form (CNF), i.e., as (/) = ^™ Vj^i where Lij is either 
of the form t = 1 (a positive (outer) literal) or of the form t 7^ 1 (a negative (outer) literal). Again, 
it is well-known and easy to see that we can for every quantifier-free formula find a formula in this 
form which is equivalent to it in every Boolean algebra. We refer to C'i := {Lij \ 1 ^ j TOj} as 
an (outer) clause of (f), and to Lij as an (outer) literal of Ci. Whenever convenient, we identify (j) 
with its set of clauses. 

4 £1 Set Constraints 

To define £1 set constraints, we need to introduce a series of important functions defined on the 
set of subsets of natural numbers. 

Definition 3. Let 

• i: (:P(N))2 V{N) be the function that maps {Si, S2) to the set {2x \ x e Si} u {2x + I \ x e 
S2}; 

9 F he the function that maps 5* c N to the set of finite non-empty subsets of S; 

9 G: N ^ F(N) be a bijection between N and the set of finite non-empty subsets of N (since 
both sets are countable, such a bijection exists); 

• e: P{N) ^ P{N) be defined by 

eiS) = {G'\T)\TeFiS)}; 
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• ei be the function defined by ei(x,y) i— > e(i{x,y)). 

Definition 4. Let /: {r{N))'' P{N) be a function, and R c be a relation. Then 

we say that / preserves R if the fohowing holds: for all a^,...,a*' 6 {V{H)Y we have that 
(/(a}, . . . , a^), . . . , f{a\, . . . , of)) e i? if a' e i? for all i ^ k. If / does not preserve R, we also 
say that / violates R. We say that / strongly preserves R if for all a^,. . . e (^'(N))' we have 
that {f{a\, . . . , a'l), . . . , f{aj, . . . , af)) e i? if and only if a* e i? for all i ^ fc. If </> is a first-order 
formula that defines a relation R over 6, and / preserves (strongly preserves) R, then we also say 
that / preserves (strongly preserves) (p. Finally, if g: (P(N))' — > V{N) is a function, we say that / 
preserves (strongly preserves) g if it preserves (strongly preserves) the graph of g, i.e., the relation 
{{xi, . . .,xi,g{xi, . . .,xi)) I xi, . . . ,Xi c N}. 

Note that if an injective function / preserves a function g, then it must also strongly preserve 

9- 

Definition 5. The set of all relations with a quantifier-free first-order definition over 6 that are 
preserved by the operation ei is denoted by £1. 

Remark. We will see later fProposition[55land ProDOsition[55|) that the class £2 is independent 
from the precise choice of the operations e and i. 

Proposition [1^ shows that £X has a large subclass, called Horn-Horn, which has an intuitive 
syntactic description. In Section [S] we also present many examples of relations that are from £X 
and of relations that are not from £1. Before, we will establish some properties of the functions i 
and e. 

Fact 6. The mapping i is an isomorphism between &^ and &. 

Proof. The mapping i can be inverted by the mapping that sends 5 c N to ({x | 2x e 5*}, {x \ 2x+l e 
S*}). It is straightforward to verify that i strongly preserves 0, 1, c, u, n. □ 

We write a; E y as an abbreviation for x n y = x. 

Proposition 7. The function e has the following properties. 

u e is injective, 

• e strongly preserves 1, 0, and n , and 

• for x,y, z e V(N) such that xuy = z, not x ^ y, and not y ^ x, we have that e{x)ue{y) ^ e{z). 

Proof. We verify the properties one by one. Since G is bijective, e{x) = e(y) if and only if x and y 
have the same finite subsets. This is the case if and only if a; = y, and hence e is injective. Thus, 
to prove that e strongly preserves 1, 0, and n, it suffices to check that e preserves 1, 0, and n. 

Since G is bijective, we have that G(N) equals the set of all finite subsets of N, and hence 
e(N) = N, which shows that e preserves 1. We also compute e(0) = G^^{F{0)) = G^^{0) = 0. 

Next, we verify that for all x,y e V{N) we have e{x) n e{y) = e{x n y). Let a e N be arbitrary. 
We have a e e{x) n e{y) if and only if G(a) e F{x) n F{y). By definition of F and since G(a) is 
a finite subset of N, this is the case if and only if G{a) e F{x n y). This is the case if and only if 
a e e(x n y), which concludes the proof that e preserves n. 
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We verify that if x u y = z, not x ^ y, and not y ^ x, then e{x) u e{y) ^ e{z). First 
observe that for all u,v N with u E u we have e{u) c e(w) since e preserves n. This implies 
that e{x) u e(y) E e{z). Since x % y and y $ x, there are a, 6 such that a e x, a ^ y, b e y, 

X. Then we have that {a,b) e F(^), but {a, 6} ^ u -F(?/). Hence, G~^{{a,b}) e e(2;), but 
G~^{{a,b]) ^ e{x) u e(j/). This shows that e{z) ^ e{x) u e{y). □ 

Note that in particular e preserves c, and ||. Moreover, e{c{x)) E c(e(x)): this follows from 
preservation of ||, since a:||c(x), and therefore e(x)||e(c(a;)), which is equivalent to the inclusion 
above. Both e and i strongly preserve n, 0, and 1, and therefore also ei strongly preserves n, 0, 
and 1. 



5 Horn-Horn Set Constraints 

A large and important subclass of £1 set constraints is the class of Horn-Horn set constraints. 
Definition 8. A quantifier-free first-order formula is called Horn-Horn if 

1. every outer clause is outer Horn, i.e., contains at most one positive outer literal, and 

2. every inner clause of positive outer literals is inner Horn, i.e., contains at most one positive 
inner literal. 

A relation R c V{nf is called 

• outer Horn if it can be defined over S by a conjunction of outer Horn clauses; 

• inner Horn if it can be defined over 6 by a formula of the form (ci n • • • n Cfe) = 1 where 
each Ci is inner Horn; 

• Horn-Horn if it can be defined by a Horn-Horn formula over &. 

The following is a direct consequence of the fact that isomorphisms between F*^ and F preserve 
Horn formulas over F; since the simple proof is instructive for what follows, we give it here for the 

special case that is relevant here. 

Fact 9. Outer Horn relations are preserved by i. 

Proof. Let be a conjimction of outer Horn clauses with variables V. Let {to = l,ti 1, ■ ■ ■ .tk ¥= 
1} be an outer clause of (p. Let u,v: V 'P(N) be two assignments that satisfy this clause. Let 
w: V V{N) be given by a; i-^ i{u{x),v{x)). Suppose that w satisfies tj = 1 for all 1 j k. 
Since i is injective we must have that tj = 1 for both u and v for I ^ j ^ k, and therefore 
neither assignment satisfies the negative literals. Hence, u and v must satisfy to = 1. Since i is 
an isomorphism between &^ and &, it preserves in particular to = 1, and hence w also satisfies 
to = l. □ 

Proposition 10. Inner Horn relations are strongly preserved by e. 

Proof. Observe that J'u (|_J^ y^) = 1 is equivalent to .I'n (flj Vj) = flj Vjj which is strongly preserved 
by e since e strongly preserves n. This clearly implies the statement. □ 
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Note that Fact [5] and Proposition [TUl imply that ei strongly preserves inner Horn relations. We 
later also need the following. 

Lemma 11. Let xi, . . . ,Xk,yi, ■ ■ ■ ^yi ^ N, where k ^ I. Then the following are equivalent. 

1. e{xi) u ■ • ■ u e(xfc) u e{yi) u ■ • ■ u e{yi) = 1. 

2. there exists an i ^ k such that Xi u (LJ^yj ) = 1- 

3. there exists an i ^ k such that e{xi) u (|_|^ ^{Uj)) = 1- 

For k = 0, we have that [J^- ?7j<j; = 1 */ and only i/|_|j<; e{yj) = 1. 

Proof. For the implication from (1) to (2), suppose that there is for every i ^ fc an G N such 
that flj ^ Xi := Xi u Let c be G~^({ai,a2, . . . Then for each i ^ k, we have that 

c ^ e{xi) u Ujsci ^iVj)- To see this, first observe that e yj n Xi. Therefore, {ai, . . . , Ok} e 

\~\j<^i F{yj) r~i F{xi) for all i ^ k. We conclude that c ^ e{xi) u ■ • • u e{xk) u e(2/i) u • • • u e{yi). 

The implication (2) =^ (3) follows directly from Proposition [TOl The implication (3) ^ (1) is 
trivial. The second statement is a direct consequence of Proposition [101 D 

Proposition 12. Every Horn-Horn relation is preserved by e and i; in particular, it is from £1 . 

Proof. Suppose that R has a Horn-Horn definition over 6 with variables V. Since R is in 
particular outer Horn, it is preserved by i by Fact [5] 

Now we verify that R is preserved by e. Let u: V ^ V^N) be an assignment that satisfies 
(p. That is, u satisfies at least one literal in each outer clause of 4>. It suffices to show that the 
assignment v: V V^N) defined hy x e{u{x)) satisfies the same outer literal. Suppose first that 
the outer literal is positive; because is Horn-Horn, it is of the form x uyi u ■ ■ ■ uyi = 1 or of the 
form yi u ■ ■ ■ uyi = 1, which is preserved by e by Lemma 1111 

Now, suppose that the outer literal is negative, that is, of the form xiu - ■ - uxkuyiu - ■ - uyl ¥= 1 
for some k ^ 0. We will treat the case fc > 1, the other case being similar. Suppose for contradiction 
that v{xi) u • • • u v{xk) u v{yi) u • • • u v{yi) = 1. By Lemma ll 11 there exists an i k such that 
u{xi)u(\_\ju{yj)) = 1. But then we have in particular that u ••• uu(a;fc) uM(yi) u ••■ uM(y/') = 
1, in contradiction to the assumption that u satisfies (j). □ 

Examples. 

1. The disjointness relation || is Horn-Horn: it has the definition x u y = 1. 

2. The inequality relation # is inner Horn; it has the definition {y u x) n (x u y) 1. 

3. Using the previous example, the relation {(x,y,u,v) \ x¥=yvu = v} can easily be seen to be 
Horn-Horn. 

4. The ternary relation {{x,y, z) | x n y c z}, which we have encountered above, has the Horn- 
Horn definition x u y u z = 1. 

5. Examples of relations that are clearly not Horn- Horn: {{x,y) \ x u y = 1} is violated by e, 
and {{x, y, z) \ {x = y) y {y = z)] is violated by i. 
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6. The formula 

{x n y x) 
A {x ny ^y) 

A {v = 1 V u = 1 V X u y ^ 1) 

is clearly not Horn-Horn. However, the relation defined by the formula is from £1: if 
(xi, yi, ui, U2) und (a;2, ^2, W2, i'2) are from that relation, then neither i{xi,X2) E i{yi,y2) 
nor 1(2/1,2/2) ^i{xi,X2). By Proposition [3 (ei(a;i, a;2), ei(?/i, ?/i), ei(ui, U2), ei(wi, 'y2)) satisfies 
the formula. 

There is no equivalent Horn- Horn formula, since the formula is not preserved by i. 

7. The formula [[x uy^\)\/{uuv = 1)) a (x u ?/ 7^ 1) a (a; u j/ 7^ 1) is not Horn-Horn. 
However, it is preserved by e and by i: the reason is that one of its clauses has the negative 
literal x u y ^ 1, and the conjuncts {x u y ^ 1} and {x u y 1}. Therefore, for every tuple 
teR the tuple e(t) satisfies x uy ^ 1 and is in R as well. By Fact[9l R is preserved by i. 

In this case, the authors suspect that there is no equivalent Horn-Horn formula. More gen- 
erally, it is an open problem whether there exist formulas that are preserved by e and i, but 
that are not equivalent to a Horn-Horn formula. 

Proposition 13. Drakengren and Jonsson's set constraint language only contains Horn-Horn re- 
lations. 

Proof. For inclusion a; ^ y, disjointness ||, and inequality ¥= this has been discussed in the examples. 
Horn-Horn is preserved under adding additional outer disequality literals to the outer clauses, so 
all relations considered in Drakengren and Jonsson's language are Horn-Horn. □ 

We prepare now some results that can be viewed as a partial converse of Proposition 1121 

Definition 14. A quantifier-free first-order formula 4> (in the syntactic form described at the end 
of Section |3]) is called reduced if if every formula obtained from (j) by removing an outer literal is 
not equivalent to 4> over &. 

Lemma 15. Every quantifier-free formula is over & equivalent to a reduced formula. 

Proof. It is clear that every quantifier-free formula can be written as a formula (f) in CNF and in 
the form as we have discussed it after Theorem [51 We now remove successively outer literals as 
long as this results in an equivalent formula. □ 

We first prove the converse of Fact [9l 

Proposition 16. Let <j) be a reduced formula that is preserved by i. Then each outer clause of (j) is 
Horn. 

Proof. Let V be the set of variables of (p. Assume for contradiction that (f> contains an outer clause 
with two positive literals, ti = 1 and t2 = 1. If we remove the literal ti = 1 from its clause C, the 
resulting formula is inequivalent to cj), and hence there is an assignment si: V V{N) that satisfies 
none of the literals of C except for = 1. Similarly, there is an assignment S2'. V ^ ■p(N) that 
satisfies none of the literals of C except for t2 = 1. By injectivity of z, and since i strongly preserves 
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c, n,u, and 1, the assignment s: V ^ 'P(N) defined by x ^ i{si{x), S2ix)) does not satisfy tfie 
two literals ii = 1 and t2 = 1- Since i strongly preserves c, u, n, none of the other literals in C is 
satisfied by those mappings as well, in contradiction to the assumption that (f) is preserved by i. □ 

Definition 17. Let be a set of variables, and s: V ^ ■p(N) be a mapping. Then a function from 
V —>■ V{N) of the form x i-* e{s{x)) is called a core assignment. 

Lemma 18. For every quantifier-free formula (j) there exists a formula ^ such that all inner clauses 
are inner Horn, and such that (j) and ^ have the same satisfying core assignments. If 4> is preserved 
by ei, then the set of all satisfying core assignments of tp is closed under ei. 

Proof. Suppose that (p has an outer clause C with a positive outer literal t = 1 such that t contains 
an inner clause c := xi u • • • u Xfc u u • ■ • u y; that is not Horn, i.e., k ^ 2. Then we replace the 
outer literal i = 1 in by fc literals ti = 1, . . . , = 1 where ti is obtained from t by replacing c by 
Xi uyi u ■ ■ ■ uyi. 

We claim that the resulting formula 4>' has the same set of satisfying core assignments. Observe 
that Xi ulji u ■ ■ ■ ulji E c, and hence ti = 1 implies t = 1. An arbitrary satisfying assignment of 4>' 
satisfies either one of the positive outer literals ti = 1, in which case that observation shows that it 
also satisfies 0, or it satisfies one of the other outer literals of C, in which case it also satisfies this 
literal in </>. Hence, (p' implies (p. Conversely, let s be a satisfying core assignment of </>. If s satisfies 
a literal from C other than t = 1, then it also satisfies this literal in cf)', and s satisfies (p'. Otherwise, 
s must satisfy t = 1, and hence s{xi) u • • • u s{xk) u s{yi) u • • • u s{yi) = 1. Since s is a core 
assignment. Lemma [TT] implies that there exists an i ^ fc such that s{xi) u s(yi) u • • • u s{yi) = 1. 
So s satisfies (j)'. 

Suppose that (p has an outer clause C with a negative outer literal t 1 such that t contains an 
inner clause c := xiu - ■ -uxkuljiU - ■ -ulji that is not Horn, i.e., k ^ 2. Then we replace the clause 
C in (p hy k clauses Ci , . . . , Ck where Ck is obtained from C by replacing c with u y u ■ • • u y; . 

We claim that the resulting formula cp' has the same set of satisfying core assignments. Observe 
that xi u ■ ■ ■ u Xk u Iji u ■ ■ ■ u Iji 1 implies that Xi u Iji u ■ ■ ■ u Iji ¥= 1, for every i ^ k. The 
observation shows that an arbitrary assignment of (p is also an assignment of cp' . Conversely, let s 
be a satisfying core assignment of cp'. If s satisfies one of the other literals of C other than t ^ 1, 
then s satisfies (p. Otherwise, s must satisfy Xi ulji u ■ ■ ■ ulji 1 for all i ^ k, and by Lemma [Til 
we have that s also satisfies xi u • • • u x^; u u • • ■ u y; # 1. 

We perform these replacements until we obtain a formula (p' where all inner clauses are Horn; 
this formula satisfies the requirements of the first statement of the lemma. 

To prove the second statement, let u,v : V ^ P(N) be two satisfying core assignments of (p' . 
Since cp' and (p have the same satisfying core assignments, u and v also satisfy (p. Then the mapping 
w: V ^ P{N) given hy x ^ ei{u{x),v{x)) is a core assignment, and because ei preserves <pj the 
mapping w satisfies (p. Since (p ^-nd (p' have the same core assignments, w is also a satisfying 
assignment of (p' , which proves the statement. □ 

Definition 19. A quantifier-free first-order formula (p (in the syntactic form described at the end 
of Section [S]) is called strongly reduced if every formula obtained from (p by removing an outer literal 
does not have the same set of satisfying core assignments over &. 

Proposition 20. Let (f> be a strongly reduced formula all of whose inner clauses are Horn. If the 
set of satisfying core assignments of (p is closed under ei, then (p is Horn-Horn. 
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Proof. Let V be the set of variables of (p. It suffices to show that all clauses of </> are outer Horn. 
Assume for contradiction that (j) contains an outer clause with two positive literals, ti = 1 and 
t2 = 1- If we remove the literal ti = 1 from its clause C, the resulting formula has strictly less 
satisfying core assignments; this shows the existence of a core assignment si : V — ► 'P(N) that 
satisfies none of the literals of C except for ii = 1. Similarly, there exists a core assignment 
S2- V —* 'P(N) that satisfies none of the literals of C except for t2 = 1- By assumption, the 
inner clauses of ti and t2 are Horn. We claim that the assignment s: V ^ P(N) defined by 
X 1-^ ei{si{x),S2{x)) does not satisfy the clause C. Since ei strongly preserves inner Horn clauses, 
we have that s does not satisfy ti = 1 v ^2 = 1- For the same reasons s does not satisfy any 
other literals in C; this contradicts the assumption that the satisfying core assignments for (j) are 
preserved by ez. □ 

Proposition 21. Let T be a finite set constraint language from £1 . Then CSP(r) can he reduced 
in linear time to the problem to find a satisfying assignment for a given set of Horn-Horn clauses. 

Proof. Let $ be an instance of CSP(r), and let V be the set of variables that appear in $. For each 
constraint -R(xi, . . . ^Xk) from $, let 4>r be the definition of R over S. By Lemma [T51 there exists 
a formula that has the same satisfying core assignments as ipji and where all inner clauses are 
Horn; moreover, since (/>fl is preserved by ei, the lemma asserts that the set of all satisfying core 
assignments of ij^n is preserved by ei. We can assume without loss of generality that is strongly 
reduced; this can be seen similarly to Lemma fTSl By Proposition [20l the formula V'i? is Horn-Horn. 

Let Vl/ be the set of all Horn- Horn clauses of formulas -fAflCa^i, ■ • ■ , a^fc) obtained from constraints 
R{xi, . . . ,Xk) in $ in the described manner. We claim that $ is a satisfiable instance of CSP(r) 
if and only if vj/ is satisfiable. This follows from the fact that for each constraint R{xi, . . . ,Xk) 
in $, the formulas and ipn have the same satisfying core assignments, and that both (pu and 
ipR are preserved by ei (for ipn this follows from Proposition [T2|) . so in particular by the function 
X 1-^ ei(x, x). □ 

Note that in Proposition[2I]we reduce satisfiability for £1 to satisfiability for a proper subclass of 
Horn-Horn set constraints: while for general Horn-Horn set constraints we allow that inner clauses 
of negative outer literals are not Horn, the reduction only produces Horn-Horn clauses where all 
inner clauses are Horn. 



6 Algorithm for Horn-Horn Set Constraints 

We present an algorithm that takes as input a set $ of Horn- Horn clauses and decides satisfiability 
of over & = {V{N); u, n, c, 0, 1) in time quadratic to the length of the input. By Proposition!^ 
this section will therefore conclude the proof that CSP(r) is tractable when all relations in F are 
from £1. 

We first discuss an important sub-routine of our algorithm, which we call the inner resolu- 
tion algorithm. As in the case of Boolean positive unit resolution |DG84] one can implement the 
procedure Inner-Res such that it runs in linear time in the input size. 

Lemma 22. Let ^ be a finite set of inner Horn clauses. Then the following are equivalent. 

1. \~\^ = 1 is satisfiable over &. 

2. Inner-Res{^) from Figure{^ accepts. 
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Inner-Res($) 

/ / Input: A finite set <E> of inner Horn clauses 
/ / Accepts iff [~1 $ = 1 is satisfiable 
During tlie entire algoritlim: 

if $ contains an empty clause, then reject. 
Repeat :— true 
While Repeat = true do 

Repeat := false 

If $ contains a positive unit clause {x} then 
Repeat := true 

Remove all clauses where the literal x occurs. 

Remove the literal x from all clauses. 
End if 
Loop 
Accept 



Figure 1: Inner Resolution Algorithm. 

3. PI $ = 1 has a solution whose image is contained in {0,N}. 

Proof. It is obvious that P|$ = 1 is unsatisfiable when Inner-Res(<I') rejects; in fact, for all inner 
clauses c derived by Inner- Res from $, the formula c = 1 is logically implied by P| $ = 1. Conversely, 
if the algorithm accepts then we can set all eliminated variables to N and all remaining variables 
to 0, which satisfies all clauses: in the removed clauses the positive literal is satisfied, and in the 
remaining clauses we have at least one negative literal at the final stage of the algorithm, and all 
clauses with negative literals at the final stage of the algorithm are satisfied. □ 

The proof of the previous lemma shows that P| <& = 1 is satisfiable over 6 if and only if P| $ = 1 
is satisfiable over the two-element Boolean algebra. As we will see in the following, this holds more 
generally (and not only for inner Horn clauses). The following should be well-known, and can be 
shown with the same proof as given in |Kop89| for the weaker Proposition 2.19 there. We repeat 
the proof here for the convenience of the reader (for definitions of the notions appearing in the 
proof, however, we refer to |Kop89| ). 

Fact 23. Let ti,t2 be terms over {n, u,c, 0, 1}. Then the following are equivalent: 

1. = 1 A ^2 7^ 1 is satisfiable over the two-element Boolean algebra; 

2. ti = \ A t2 is satisfiable over all Boolean algebras; 

3. ^1 = 1 A ^2 1 is satisfiable in a Boolean algebra. 

Proof. Obviously, 1 implies 2, and 2 implies 3. For 3 implies 1, assume that ti = 1 a t2 ^ 1 has a 
satisfying assignment in some Boolean algebra £. Let c be the element denoted by t2 in £ under 
this assignment. It is well-known that every element a # of a Boolean algebra is contained in an 
ultrafilter (see e.g. Corollary 2.17 in |Kop89| ). So let U be an ultrafilter of £ that contains c, and 
let / : £ ^ {0, 1} be the characteristic function of U. Then / is a homomorphism from £ to the 
two-element Boolean algebra that maps c to 0; thus ti = 1 a i2 1 is satisfiable over {0, 1}. □ 
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The same statement for ti = 1 instead oi ti = 1 a t2 ^ 1 is Proposition 2.19. in |Kop89| . 
Fact [23] has the following consequence that is crucial for the way how we use the inner resolution 
procedure in our algorithm. 

Lemma 24. Let ^ be a finite set of inner Horn clauses. Then Inner- Res{'i'Kj{xi, . . . ,Xk, J/Oi • ■ • i Ui}) 
rejects if and only if = 1 implies that xi u ■ ■ ■ u ulji u ■ ■ ■ uyi = 1 over &. 

Proof. PI ^E* = 1 implies that xi u ■ ■ ■ u u Iji u ■ ■ ■ u Iji = 1 if and only if p| 5* = 1 a xi u 
■■■uxi^uyiU-'-ulji 1 is unsatisfiable over &. By Fact I23[ this is the case if and only if 
\~\'^ = 1 A xi u ■ ■ ■ u Xk ulji u ■ ■ ■ ulji ¥= 1 is unsatisfiable over the 2-element Boolean algebra, 
which is the case if and only if P| ^' = 1 a xi u ■ • • u Xfc u u • • • u = is unsatisfiable over 
the two-element Boolean algebra. As we have seen in Lemma [22l this is turn holds if and only if 
Inner-Res(^'u{xi, . . . ,Xk,yi, . ■ . ,yi}) rejects. □ 



Outer-Res("l>) 

/ / Input: A finite set <1> of Horn-Horn clauses 

/ / Accepts iff "I> is satisfiable over (P(N); n, u, c, 0, 1) 

During the entire algorithm: 

if $ contains an empty clause, then reject. 
Repeat := true 
While Repeat — true do 

Repeat := false 

Let be the set of all inner Horn clauses of terms t 

from positive unit clauses {t = 1} in 

If Inner-Res rejects ^, then reject. 
For each negative literal t =^ 1 m clauses from "1> 

For each inner clause D — , . . . , X}^ ^y^^ . . . ^yj^ I oft 
Call Inner-Res on 

* u {^1 = 1, . . . ,^fe = l,yo = 1, • • • = 1} 
If Inner-Res rejects then remove clause D from t 
End for 

If all clauses in t have been removed, then 
Remove outer literal t ^ 1 from its clause 
Repeat := true 
End for 
Loop 
Accept 



Figure 2: Outer Resolution Algorithm. 

Theorem 25. The algorithm 'Outer-Res' in Figure\^ decides satisfiability for sets of Horn-Horn 
clauses in quadratic time. 

Proof. We first argue that if the algorithm rejects $, then $ has indeed no solution. First note that 
during the whole argument, the set of clauses $ has the same satisfying tuples (i.e. the corresponding 
formulas are equivalent): Observe that only negative literals get removed from clauses, and that a 
negative literal t ^ 1. only gets removed from a clause when Inner- Res rejects ^I^ u {xi = 1, . . . , aJfc = 
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1,2/0 = Ij • ■ • : y; = 1} for each inner clause {xi, . . . , Xk,yi, ■ ■ ■ ,yi} oi t. By Lemma [Ml if Inner- Res 
rejects \E'u{a;i = 1, . . . , = 1, j/o = Ij • • • , ?/; = 1} then imphes that xiu- ■ -uxkuljiU- ■ -uy; = 1. 
Hence, the positive unit clauses imply that t = 1 and therefore the literal t 7^ 1 can be removed 
from the clause without changing the set of satisfying tuples. Now the algorithm rejects if either 
Inner- Res rejects 5" or if it derives the empty clause. In both cases it is clear that $ is not satisfiable. 

Thus, it sufHces to construct a solution when the algorithm accepts. Let be the set of all inner 
clauses of terms from positive unit clauses at the final stage, when the algorithm accepts. For each 
remaining negative outer literal {t 7^ 1} and each remaining inner clause D = {xi, . . . ,Xk,yi, ■ ■ ■ ,yi} 
oit there exists an assignment ao from V —>■ V(N) that satisfies u • ■ • ux^ uyj^ u • • • uy/ ¥= 1}: 

otherwise, by Lemma [24j the inner resolution algorithm would have rejected \1/ u {xi = 1, . . . ,Xk = 
1,2/0 = 1, ■ • • , y; = l}j and would have removed the inner clause D from t. Let Di, . . . , Dg be an 
enumeration of all remaining inner clauses D that appear in all remaining negative outer literals. 

Write is for the s-ary operation defined by (xi, . . . , Xg) 1-^ i{xi,i{x2, ■ ■ ■ , i{xs-i,Xs) • • ■)) (where 
i is as in Fact [6]). We claim that s:V^ V{H) given by 

X ^ isiaoiix), . . . ,a_D^(a;)) 

satisfies all clauses in $. Let C be a clause from $. By assumption, at the final stage of the 
algorithm, the clause C is still non-empty. Also note that since all formulas in the input were Horn- 
Horn, they contain at most one positive literal. This holds in particular for C, and we therefore 
only have to distinguish the following cases: 

• At the final state of the algorithm, C still contains a negative literal i 7^ 1. Since t 7^ 1 
has not been removed, there must be a remaining inner clause D = {xi, . . . ,Xk,yi, ■ ■ ■ ^iJi} of 
t. Observe that s{xi) u • • • u s{xk) u s(t/i) u • • • u s{yi) = 1 if and only if aoj (xi) u • • • u 
c^Djixk) L-i ctDjivi) u ■ • ■ u aujivi) = 1 for all 1 ^ j ^ s. Hence, and since aij{xi) u • • • u 
a£)(xfc) u aoiyi) li ■ • • u aoiyi) 1, s satisfies t ^ 1. This shows that s satisfies C. 

• All negative literals have been removed from C during the algorithm. The positive literal 
to = 1 of C is such that the inner clauses of are Horn. They will be part of 4', and 
therefore tg = 1 is satisfied by s. Indeed, by assumption the assignments satisfy 4', and 
4* is preserved by i. 

We conclude that s is a solution to The inner resolution algorithm has a linear time complexity; 
the outer resolution algorithm performs at most a linear number of calls to the inner resolution 
algorithm, and it is straightforward to implement all necessary data structures for outer resolution 
to obtain a running time that is quadratic in the input size. □ 

Combining Proposition [21] with Theorem 1251 we obtain the following. 

Theorem 26. Let T be a finite set constraint language from £1 . Then CSP(r) can be solved in 
quadratic time. 

7 Maximality 

In this section we show that the class £1 is a maximal tractable set constraint language. More 
specifically, let F be a set constraint language that strictly contains all £1 relations. We then 
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show that r contains a finite set of relations T' such that aheady the problem CSP(r') is NP-hard 
(Theorem go]). 

In our proof we use the so-called universal-algebraic approach to the complexity of constraint sat- 
isfaction problems, which requires that we re-formulate set CSPs as constraint satisfaction problems 
for uj- categorical structures. For a more detailed introduction to the universal-algebraic approach for 
w-categorical structures, see |Bodl2j . A structure F with a countable domain is called io- categorical 
if all countable structures that satisfy the same first-order sentences as F are isomorphic to F (see 
e.g. |Hod93) l. By the theorem of Ryll-Nardzewski, and for countable signatures, this is equivalent 
to requiring that every relation that is preserved by the automorphism^^ of F is first-order definable 
in F (see e.g. |Hod93| ). The set of all automorphisms of F is denoted by Aut(F). 

It is well-known that all countable atomies^ Boolean algebras are isomorphic (Corollary 5.16 
in |Kop89| ; also see Example 4 on page 100 in |Hod93) ): let 21 denote such a countable atomless 
Boolean algebra. Let A denote the domain of 21. Again, we use n and u to denote join and meet 
in 2t, respectively. Since the axioms of Boolean algebras and the property of not having atoms 
can all be written as first-order sentences, it follows that 21 is w-categorical. A structure 58 has 
quantifier elimination if every first-order formula is over *8 equivalent to a quantifier-free formula. 
It is well-known that 2t has quantifier elimination (see Exercise 17 on Page 391 in }Hod93) ). We 
will also make use of the following. 

Theorem 27 (Corollary 5.7 in [M096| ). A quantifier-free formula is satisfiable in some infinite 
Boolean algebra if and only if it is satisfiable in all infinite Boolean algebras. 

A fundamental concept in the complexity theory of constraint satisfaction problems is the notion 
of primitive positive definitions. A first-order formula is called primitive positive (pp) if it is of the 
form 

(-01 A • ■ • A -0 

where for each i ^ m the formula ipi is of the form R{yi, . . . ,yi) or of the form yi = 7/2, and 
where i? is a relation symbol and yi, ?/2, ■ • ■ , yz are either free variables or from {xi, . . . , a;„}. We 
say that a fc-ary relation R c D'^ is primitive positive definable (pp definable) over a r-structure F 
with domain D iff there exists a primitive positive formula (j){xi, . . . ,Xk) with the k free variables 
xi, . . . ,Xk such that a tuple (61, . . . , bk) is in R if and only if 0(6i, . . . , bk) is true in F. 

Example. The relation {{x,y) e ■p(N)^ | a; c y} is pp definable in (■p(N); 5, 7^) where S = 
{(x,?/, z) I X n y E z}. The pp definition is S{x,x,y) a x ¥= y (the definition is even quantifier- 
free). □ 

Example. The relation {{xi,X2, X3,y) e V{Ny | xi n X2 n cca E y} is pp definable in {V{N); S) 
where S = {(x, y, z) | x n y E z}. The pp definition is 3u (^(xi, X2, u) a S{u, X3, y)). □ 

When every relation of a structure F is preserved by an operation /, then / is called a polymor- 
phism of F. Note that polymorphisms of F also preserve all relations that have a pp definition in F. 
The following has been shown for finite domain constraint satisfaction in (BKJ05j : the easy proof 
also works for infinite domain constraint satisfaction. 

Lemma 28. Let R be a relation with a primitive positive definition in a structure F. Then CSP(F) 
and the CSP for the expansion of F by the relation R are polynomial-time equivalent. 

^An isomorphism of a structure F with itself is called an automorphism of F. 

■^An atom in a Boolean algebra is an element x ^ such that for all y with x n y = y and x ^ y we have y = 0. 
If a Boolean algebra does not contains atoms, it is called atomless. 
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The following theorem is one of the reasons why it is useful to work with w-categorical templates 
(when this is possible). 

Theorem 29 (from [BN06] ). Let T be an lo- categorical structure. Then R is primitive positive 
definable in T if and only if R is preserved by all polymorphisms ofT. 

The previous and the next result together can be used to translate questions about primitive 
positive definability into purely operational questions. Let D be a set, let 0^"^ be D" D, and 
let O be lJn=i the set of operations on D of finite arity. An operation n e Of^"^ is called a 
projection if for some fixed i e {1, . . . , n} and for all n-tuples {xi, . . . , x„) e Z?" we have the identity 
7r(xi, . . . , Xn) = Xi. The composition of a fc-ary operation / and k operations gi, . . . ,gk of arity n 
is the n-ary operation defined by 

{f{9l,---,9k)){xi,...,Xn) 
= f{9l{xi, ...,Xn),---,gk{xi,..., X„)) . 

Definition 30. We say that T ^ O locally generates f : -D" ^ £> if for every finite subset A of Z?" 
there is an operation g: £>" D that can be obtained from the operations in F and projection 
maps by composition such that /(a) = g{a) for all a e A. 

Theorem 31 (see [Sze86j ). Let J- ^ O be a set of operations with domain D. Then an operation 
f : D preserves all finitary relations that are preserved by all operations in J- if and only if 

J- locally generates f. 

In the following, we always consider sets of operations J" that contain Aut(2t), and therefore 
make the following convention. For F ^ O, we say that F generates f € O \i F kj Aut(2t) locally 
generates /. We now define analogs of the operations e and i, defined on A instead of V{H). 

Proposition 32. There is an isomorphism i between and 21. 

Proof. It is straightforward to verify that 21^ is again a countable atomless Boolean algebra. □ 

Motivated by the properties of e described in Lemma llll we make the following definition. 

Definition 33. Let *B and *B' be two arbitrary Boolean algebras with domains B and B' , respec- 
tively, and let 5 : _B ^ _B' be a function that strongly preserves n, 0, and 1. We say that g forgets 
unions if for all /c ^ 1, Z 5= 0, and Xi, . . . , Xk,yi, ■ ■ . ,yi s B we have 

e{xi) u ■ • ■ u e{xk) u e(yi) u • • • u e{yi) = 1 

if and only if there exists an i ^ fc such that Xi uyi u ■ ■ ■ uyj = 1. 

Proposition 34. There exists an injection e: A A that strongly preserves n, 0, and 1 in 21, 
and that forgets unions. 

Proof. The construction of e is by a standard application of Konig's tree lemma for w-categorical 
structures (see e.g. |Bodl2) ): it suffices to show that there is an injection / from every finite induced 
substructure *B of 21 to 2t such that / strongly preserves n, 0, and 1, and forgets unions. 

So let *B be such a finite substructure of 21, and let B be the domain of *B. Let £ = 
(^(S); n, u, c, 0, 1) be the Boolean algebra of the subsets of B. We claim that g: B ^ T^{B) 
given by 17(1) = 1 and g{x) = {z\z^Oaz E*^ x} for x # 1 
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• preserves and 1: this is by definition; 

• preserves n: for x,y e B (including the case that a; = 1 or y = 1) we have 

g{x) n*^* g{y) = {z | z 7^ a z x a z E*^ y} 
= {z|z7^0azc» (a;n'»2/)} 
= g{x n*^ y) ; 

• is injective: ii x,y € B such that g{x) = g{y), then x y and y x, and hence x = y; 

• strongly preserves n: this follows from the previous two items; 

• forgets unions: This can be shown analogously to the proof of Lemma [TT] 

Clearly, there is an embedding h from £ into 2t. Then / := h{g) is a homomorphism from !B to 21 
that forgets unions. □ 

Proposition 35. Let (p he a quantifier-free first-order formula over the signature {n, u, c, 0, 1}. 

Then e preserves (p over (5 if and only if e preserves (j) over 2t. Moreover, every operation from 
A A that strongly preserves n, 0, and 1 and forgets unions generates e, and is generated by e. 

Proof. Let a be a tuple of elements from A. Clearly, there exists a tuple b of elements from V(N) 
such that a and b satisfy the same set ijj of quantifier-free first-order formulas; this follows from the 
fact that every finite Boolean algebra is the Boolean algebra of subsets of a finite set. Now observe 
that whether or not the tuple e{b) satisfies a quantifier- free first-order formula (j) only depends on 
ip, by Lemma fTTl Since e strongly preserves n, 0, and 1, and forgets unions, the same is true for 
the quantifier- free first-order formulas that hold on e(a). Hence, e preserves (j) over 21 if and only if 
e preserves (p over 6. 

To prove the second part of the statement, we use Theorem l31l Suppose that c and d are tuples 
of elements from A that satisfy the same quantifier-free first-order formulas. By the equivalent 
characterization of cj-categoricity mentioned above, and the fact that 21 has quantifier-elimination, 
there exists an automorphism a of 2t that maps c to d. By the above observations and Theorem [3TJ 
this implies that all operations that strongly preserve n, 0, and 1, and forget unions generate each 
other. □ 

Let ei be the operation {x,y) 1-^ e{i{x,y)). The following can be shown similarly to Proposi- 
tion [35l 

Proposition 36. Let p be a quantifier-free first-order formula over the signature {n, u, c, 0, 1}. 
Then ei preserves <p over 6 if and only if ei preserves <p over 21. Moreover, every binary operation 
g that strongly preserves n, 0, and 1, and forgets unions generates ei, and is generated by ei. □ 

We now give the central argument for the maximal tractability of £/, stated in universal- 
algebraic language. We say that an operation from A''' A depends on the argument i e {1, . . . , fc} 
if there is no (A:— l)-ary operation /' such that for all xi, . . . , Xfc e A 

f{xi, . . . , X^.) = f'{xi, . . . , Xi-i, Xi-i, Xi+i, . . . , Xk) ■ 

We can equivalently characterize fc-ary operations that depend on the i-th argument by requiring 
that there are xi, . . . ,Xk e A and e A such that 

f{xi,. ..,Xk) f{xi,. . . ,Xi-i,Xi,Xi+i, . ..,Xk) ■ 
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Theorem 37. Let f be an operation generated by {ei}. Then either {/} generates ei, or / is 
generated by {e}. 

Proof. To show the statement of the theorem, let / be a fc-ary operation generated by {ei}. For the 
sake of notation, let xi, . . . , a;; be the arguments on which / depends, for / ^ k. Let /': A' ^ A be 
the operation given by f'{xi, . . . , xi) = f{xi, . . . , a:;, . . . , Xi). Clearly, /' must be injective (since it 
is generated from an injective operation and depends on all arguments). Since /' is generated by 
ei it preserves n, 0, 1, and since /' is injective, it also strongly preserves those functions. 

Consider first the case that ^ = 1, i.e., /' is unary. If for all finite subsets of A, the operation /' 
equals an automorphism of 21, then / is generated by Aut(2l) and there is nothing to show. So as- 
sume otherwise; that is, assume that there is a finite set S* c A such that there is no a e Aut(2t) with 
f'{x) = a{x) for all x e S. We claim that /' forgets unions. To see this, let ui, . . . , Um, vi, ■ ■ ■ ,Vn 
be from A such that f'{ui) u • ■ • u f'{u,n) u f'{vi) u • • • u f'{vn) = 1- Since /' is generated by 
{ei}, there is a term composed from automorphisms of 2t and ei such that f'{x) = T{x) for all 
x e S 'u {mi, . . . , Um,vi, ■•■,«„}. By the choice of S, this term cannot be composed of automor- 
phisms alone, and hence there must be o e Aut(2l) and operational terms Ti,T2 composed from 
automorphisms of 21 and ei such that f'{x) = a{ei{Ti{x) , T2{x))) for all x e S. As ei forgets unions, 
there exists an i ^ A; such that Ti{ui) u Ti(wi) u • • • u Ti{vn) = 1- Since Ti strongly preserves n we 
conclude that there exists an i such that u wi u • • ■ u w„ = 1 (see the proof of Proposition [TO)) . 
which is what we wanted to show. By Proposition [35] it follows that /' is generated by e. But then 
/ is generated by e as well. 

Next, consider the case that / > 1. Let g be the binary operation defined by g{x,y) = 
f'{x,y,...,y); since /' is injective, the operation g will also be injective, and in particular de- 
pends on both arguments, and strongly preserves 0, 1, and n. We claim that g forgets unions. 
Let ui = {ul,ul), . . . ,u„t = {u l^^,ul^ ),vi = {vl,vf), . . . ,v„ = {vl^,vl) be from A^ such that 
g{ui) u ■ • • u g{um) u g{vi) u • • • u g{vn) = 1- Since g is generated by ei and cannot be generated by 
the automorphisms of 21 alone, there is a term of the form T{x, y) = a{ei{Ti{x, y), T2{x, y))) where 

• a e Aut(2l), 

• Ti and T2 are operational terms composed from automorphisms of 21 and ei, 

• g{x,y) = T{x,y) for all ix,y) e {ui, . . . , w™, ui, . . . , u„}. 

Since ei forgets unions, there exists an i ^ A; such that Ti{ui) u Ti(vi) u • ■ • u Ti{vn) = 1 and 
T2(ui) u T2{vi) u ■ • ■ u T2{vn) = 1- Supposc first that Ti depends on both arguments. Then Ti 
defines an injective operation and strongly preserves n. It follows that u tJi u • • • u zJ„ = 1 in 
2t^ since these equations are inner Horn. We can argue similarly if T2 depends on both arguments, 
and in those cases we have established that g forgets unions. Suppose now that each of Ti and 
T2 does not depend on both arguments. Consider first the case that Ti only depends on the 
first argument. Then the function x 1— > Ti{x,x) is injective and strongly preserves n, and from 
Ti(uj) u Ti(yi) u • ■ • u Ti{vn) = 1 we derive as above that u\ u v\ u ■ ■ ■ u v\ = 1 holds in 2t. 
In this case, T2 must depend on the second argument, since T depends on both arguments. We 
therefore also have that uf u v\ u ■ ■ ■ u v'^ = 1 holds in 21. The situation that Ti only depends on 
the second argument and T2 only depends on the first argument is analogous. So g forgets unions. 
By Proposition [Ml 9 generates ei. Consequently, also / generates ei. □ 
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Definition 38. Let U,I ^ be the following relations 

U ■. = {{x,y,z) \{xuy = z)} 
I ■. = {{x,y,z) \{xny = z)} 

Note that both U and / are preserved by i. The following demonstrates that when F has the 
polymorphism i, this does not suffice for tractability of CSP(r). 

Proposition 39. Let T be the structure with domain A and three relations U, I, and ^. Then 
CSP(r) is NP-hard. 

Proof. The proof is by reduction from 3SAT. We compute from a given 3SAT instance $ with 
variable set V an instance 4* of CSP(r) (in polynomial time) as follows. There are distinguished 
variables t and / in For each variable x of <& there are two variables Xt and Xf in "i/. For a 
clause C := {^1,^2, ^3} of $ (with literals k either of the form x or of the form -^x) we create a 
fresh variable uc, and add the constraints U (wi, W2, ""c) and U {uc, V3,t) where Vi := Xf if k = -^x, 
and Vi := Xt ii h = x. Moreover, we add for each variable x e V the constraints U{xt,Xf,t) and 
I{xt,Xf, /). Finally, add the constraint t ^ f and I{t, /, /). 

It is clear that if $ has the satisfying assignment a: V {0, 1} then the following assignment 
/3 satisfies all constraints in ^. Choose Sf St arbitrarily. Then /3 maps t to St and / to Sf, it 
maps Xt e V to St if a{x) = 1 and to Sf otherwise, it maps Xf e V to Sf if a(x) = 1 and to St 
otherwise, and for every clause C = {li, I2, h} of $ it maps uc to St if a{li) = 1 or a{l2) = 1, and 
to Sf otherwise. 

Conversely, suppose that /3 maps the variables of to the elements of F satisfying all constraints 
of Let *B be the finite Boolean algebra that is generated by I3{V) in F. Since /?(/) £ f3{t), we 
have that /3(t) is non-empty. Select an arbitrary atom a of *8 that is contained in /3{t). Then we 
set a{x) for a; 6 y to 1 if a c /3{xt) and to otherwise. In this way all clauses {li, I2, h} of $ are 
satisfied. To see this, assume for simplicity of presentation that h = — 'X is negative and h = y and 
I3 = z are positive; the general case is analogous. Since we have the constraints U{xf,yt,uc) and 
U{uc,zt,t), and since a is an atom of *8, one of /3{xf), /3{yt), /3{zt) must contain a. If a is in Plyt) 
or I3{zt) then a{y) or a{z) is set to 1. If a is in I3{xf), then the clause I{xt, Xf, /) forces that a is 
not in l3(xt), and hence a{x) is set to 0. Thus, a sets at least one of -^x,y,z to 1, and the clause 
C is satisfied. □ 

Theorem 40. Let T be a set constraint language. Suppose that F contains all relations from 81 , 
and also contains a relation that is not from £1 . Then there is a finite sublanguage F' of F such 
that CSP(F') is NP-hard. 

Proof. When i?2, ■ • • are the relations of F, let ^2, • ■ • be quantifier-free first-order formulas 
that define i?f , i?2 ' • ■ • '^^^^ ® = ('^(N); u, n, c, 0, 1). Let Rf, i?f , ... be the relations defined by 
(f)i, 4)2, ■ ■ ■ over 21, and let A be the relational structure with domain A and exactly those relations. 
By Proposition 1361 A contains a relation that is not preserved by ez, and contains all relations that 
are preserved by ei. Consider the set T of all polymorphisms of A. 

The set J- does not contain ei, since this would contradict by Theorem[2I]the fact that A contains 
a relation that is not preserved by ei. Since is locally closed, it follows from Theorem 1571 that 
all operations f ^ T are generated by e. But then the relation {(x, y, z) \ x = yi^zyxi^y = z] 
is preserved by all operations in T , and hence pp definable in F by Theorem [29l This relation 
has an NP-complete CSP jBK08] . Let A' be the reduct of A that contains exactly the relations 
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that appear in those pp definitions. Clearly, there are finitely many such relations; we denote the 
corresponding relation symbols by r' c r. By Lemma [28l CSP(A') is NP-hard. 

This establishes also the hardness of CSP(r): let F' be the r'-reduct of F. We claim that 
CSP(F') and CSP(A') are the same computational problem. We have to show that a conjunction 
of atomic r'-formulas $ is satisfiable in F' if and only if it is true in A'. Replacing each atomic 
r'-formula in $ by its quantifier- free first-order definition, this follows from Theorem 1271 □ 

8 Concluding Remarks 

We have introduced the powerful set constraint language of £1 set constraints, which in particular 
contains all Horn-Horn set constraints and all previously studied tractable set constraint languages. 
Constraint satisfaction problems over £1 can be solved in polynomial - even quadratic - time. Our 
tractability result is complemented by a complexity result which shows that tractability of £1 set 
constraints is best-possible within a large class of set constraint languages. 

It is not hard to see from the properties we prove for £1 set constraints that there is an algorithm 
to test whether a given finite set constraint language (where relations in the language are given by 
quantifier-free formulas over the signature {u, n, c, 0, 1}) is contained in £1. This means that the 
so-called meta-problem for £1 set constraints can be decided effectively. 

We would also like to remark that one can analogously obtain tractability for the class of 
constraints where the inner clauses of the positive outer literals are dual Horn (i.e., have at most 
one negative literal). 
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